Universal Networking Language Based Analysis and Generation for Bengali Case Structure Constructs

نویسندگان

  • Kuntal Dey
  • Pushpak Bhattacharyya
چکیده

Case structure analysis forms the foundation for any natural language processing task. In this paper we present the computational analysis of the complex case structure of Bengalia member of the Indo Aryan family of languageswith a view toward interlingua based MT. Bengali is ranked 4 in the list of languages ordered according to the size of the population that speaks the language. Extremely interesting language phenomena involving morphology, case structure, word order and word senses makes the processing of Bengali a worthwhile and challenging proposition. A recently proposed scheme called the Universal Networking Language has been used as the interlingua. The approach is adaptable to other members of the vast Indo Aryan language family. The parallel development of both the analyzer and the generator system leads to an insightful intra-system verification process in place. Our approach is rule based and makes use of authoritative treatises on Bengali grammar.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Problems and Prospects: Universal Networking Language on Bangla Sentence Structure Perspective

The World Wide Web (WWW) is most effective communication media now a day. The WWW represents a revolutionary tool to communicate and access information. It enables us to access innumerable documents about a huge variety of topics, from any place around the world. However, despite the abundance of information, languages very often cause problems. When most of the web pages today are written in f...

متن کامل

Structure of Dictionary Entries of Bangla Morphemes for Universal Networking Language (UNL)

This paper describes a structure of dictionary entries of Bangla (widely used as Bengali) Morphemes for Universal Networking Language (UNL). The UNL is an artificial language developed for conveying linguistic expressions in order to represent websites information into a standard form. In order to integrate Bangla into this platform it is necessary to develop both a dictionary and a grammar, wh...

متن کامل

Query Focused Summary Generation System using Unique Discourse Structure

In this paper, the authors propose a query focussed summary generation system which is constructed on top of a unique language-independent discourse structure. The discourse structure is comprised of three text representation techniques, namely, Universal Networking Language (UNL), Rhetorical Structure Theory (RST) and saṅgatis. The discourse structure is indexed based on a concept called sūtra...

متن کامل

Design of a Rule-based Stemmer for Natural Language Text in Bengali

This paper presents a rule-based approach for finding out the stems from text in Bengali, a resource-poor language. It starts by introducing the concept of orthographic syllable, the basic orthographic unit of Bengali. Then it discusses the morphological structure of the tokens for different parts of speech, formalizes the inflection rule constructs and formulates a quantitative ranking measure...

متن کامل

Generation of Bangla Text from Universal Networking Language Expression

This paper presents a work on generating Bangla sentences from an interlingua representation called Universal Networking Language (UNL). UNL represents knowledge in the form of semantic network like hyper-graphs which contains disambiguated words, binary semantic relations, and speech act like attributes associated with the words, assisted by the semantically rich lexicon and a set of analysis ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005